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Summary 


The genomic RNA of the coronavirus IBV contains an 
efficient ribosomal frameshifting signal at the junc- 
tion of two overlapping open reading frames. We have 
defined by deletion analysis an 86 nucleotide se- 
quence encompassing the overlap region which is 
sufficient to allow frameshifting in a heterologous 
context. The upstream boundary of the signal con- 
sists of the sequence UUUAAAC, which is the likely 
site of ribosomal slippage. We show by creation of 
complementary nucleotide changes that the RNA 
downstream of this “slippery” sequence folds into a 
tertiary structure termed a pseudoknot, the formation 
of which is essential for efficient frameshifting. 


Introduction 


Ribosomal frameshifting is a recently described mecha- 
nism of translational regulation in which a directed 
change of translational reading frame allows the synthesis 
of a single protein from two (or more) overlapping genes 
(see Roth, 1981, Craigen and Caskey, 1987, for reviews). 
So far, almost all the examples of this kind of control in 
higher eukaryotes have come from retroviruses, where 
frameshifting appears to be a mechanism for the regula- 
tion of expression of the viral RNA-dependent DNA poly- 
merase. One termination codon in Rous sarcoma virus 
(RSV) (Jacks and Varmus, 1985) and human immunodefi- 
ciency virus type 1 (HIV-1) (Jacks et al., 1988a), and two 
in mouse mammary tumor virus (MMTV) (Moore et al., 
1987; Jacks et al., 1987) are suppressed by (-1) ribosomal 
frameshifts into alternative overlapping open reading 
frames (ORFs), generating the gag/po! polyproteins from 
which the viral polymerases are subsequently derived. In 
addition, frameshifting appears to be necessary for the 
expression of the reverse transcriptase enzymes of a 
number of retrotransposons, such as yeast Ty1 (Mellor et 
al., 1985; Wilson et al., 1986; Clare et al., 1988) and Ty912 
(Clare and Farabaugh, 1985) and the Drosophila 176 
(Saigo et al., 1984) and gypsy (Marlor et al., 1986) ele- 
ments. 

Recently, however, we described the first non-retroviral, 
higher eukaryotic example of the phenomenon, in the 
avian coronavirus infectious bronchitis virus (IBV) (Brier- 
ley et al., 1987). IBV, the type species of the Coronaviridae 
(Siddell et al., 1983), has a large, single-stranded, continu- 
ous RNA genome of positive polarity (Schochetman et al., 
1977; Stern and Kennedy, 1980). Three-quarters of the 


genomic coding capacity is contained in two large ORFs, 
F1 and F2, which potentially encode 440 kd and 300 kd 
polypeptides respectively (Boursnell et al., 1987). The 
ORFs of F1 and F2 overlap by 42 nucleotides, with F2 in 
the —1 frame with respect to F1 (see Figure 1). Using in 
vitro transcription and translation, we demonstrated that 
F2 is produced as a fusion protein with F1 as a result of 
a highly efficient (25%-30%) (-1) ribosomal frameshift 
that suppresses the F1 termination codon (Brierley et al., 
1987). The role of this translational mechanism in IBV 
replication is so far unknown, but since the F1 and F2 ORF 
are assumed to encode, at least in part, components of 
the virus-specific RNA-dependent polymerase, and since 
the predicted amino acid sequence of F2 contains regions 
in common with other positive strand virus replicases 
(Boursneil et al., 1987; Hodgman, 1988), it may be that the 
overall strategy is similar to that understood for retro- 
viruses. 

There is considerable interest in the precise mecha- 
nism by which ribosomal frameshifting operates, and 
work from several groups has shown that the specificity 
for the event resides in the nucleotide sequence of the 
RNA around the site at which frameshifting occurs, since 
it has proved possible to induce frameshifting in a heterol- 
ogous context by inserting cloned DNA corresponding to 
the frameshift site into unrelated genes (Brierley et al., 
1987; Jacks et al., 1988b). For RSV and HIV, the site at 
which frameshifting occurs has been identified by a com- 
bination of site-directed mutagenesis and amino acid se- 
quence analysis (Jacks et al., 1988a, 1988b). These 
authors, from a comparative analysis of a large number of 
retroviruses thought to utilize frameshifting as a means of 
controlling gene expression, have suggested that certain 
heptanucleotide RNA sequences, initiating with two 
homopolymeric triplets, can allow tRNA slippage during 
translation, leading to a (-1) frameshift. In addition to 
these “slippery” sequences, potential RNA stem—loop 
structures located downstream of most retroviral shift 
sites have been proposed to make contributions to the 
frameshifting process (Rice et al., 1985; Moore et al., 
1987; Jacks et al., 1987, 1988a), and indeed, the presence 
of a stem-loop downstream of the RSV frameshift site has 
been shown to be essential (Jacks et al., 1988b). How the 
stem-loop influences frameshifting is not known, but it 
has been suggested (Jacks et al., 1988b) that ribosomes 
may slow or stall at the stem-loop, increasing the likeli- 
hood of a tRNA slippage event. 

In this paper we describe analysis of the RNA se- 
quences responsible for frameshifting in IBV, and define 
the minimal amount of information from the F1/F2 overlap 
that is sufficient to induce frameshifting at high efficiency. 
The upstream limit of this essential region has the charac- 
teristics of a “slippery” sequence as defined previously 
(Jacks et al., 1988b), and mutational analysis of this se- 
quence is entirely consistent with frameshifting occurring 
at this site. The remainder of the essential region consists 
of about 80 nucleotides immediately downstream of the 
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Figure 1. Ribosomal Frameshifting In Vitro 


—— (ann 3° and In Vivo 


(A) Diagram of plasmid pFS7 showing the 
predicted sizes of protein products that would 
be expected following ribosomal frameshifting 
within the F1/F2 overlap during translation of a 
mRNA derived from Smal-digested pFS7 tem- 
plate. (B) Left-hand panel: reticulocyte lysate 
and Xenopus oocyte translation products syn- 
thesized in response to RNA transcribed from 
Smai-digested pFS7. RNA was translated and 
products were labeled with [25S]methionine as 
described in Experimental Procedures. Poly- 
peptides were separated on a 10% SDS-poly- 
acrylamide gel and detected by autoradiogra- 
phy. Right-hand panel: reticulocyte lysate 
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“slippery” sequence, and we present evidence that effi- 
cient frameshifting depends on the formation, by these 
sequences, of a tertiary RNA structure in the form of a 
“pseudoknot” (Studnicka et al., 1978; Pleij et al., 1985). 


Results 


Ribosomal Frameshifting In Vitro and In Vivo 

A ribosomal frameshifting mechanism for the expression 
of the IBV F2 ORF was first suggested by Boursnell and 
colleagues (1987). As no products encoded by F1 and F2 
have been detected in IBV-infected cells, we tested this hy- 
pothesis by in vitro transcription and translation of acDNA 
containing the F1/F2 overlap region cloned within a 
marker gene (PB2 of influenza virus A/PR8/34) under the 
control of the bacteriophage SP6 promoter (Brierley et al., 
1987). Translation of synthetic mRNA derived from this 
plasmid (pFS1) in vitro yielded products whose size and 
pattern of immunoprecipitation (with specific antisera 
raised against defined portions of the marker protein) 
were entirely consistent with an efficient (-1) ribosomal 
frameshifting event at the end of the F1 ORF, resulting in 
the production of an F1-F2 trans-frame protein (Brierley et 
al., 1987). These earlier studies, however, did not test if the 


frameshift signal was functional in vivo. To address this 
question, we wished to test the capacity of our synthetic 
mRNA to signal frameshifting in Xenopus oocytes after 
microinjection. However, since the influenza PB2 protein 
is relatively unstable in Xenopus oocytes, we constructed 
an alternative recombinant plasmid, pFS7, in which the 
IBV F1/F2 overlap was cloned within the PB1 gene of in- 
fluenza virus, which in turn was flanked by the 5’ and 3’ 
noncoding regions of the Xenopus B-globin gene, down- 
stream of an SP6 promoter (Krieg and Melton, 1984) (Fig- 
ure 1A). Linearization of this plasmid with Smal, followed 
by transcription, resulted in the production of a capped 
and polyadenylated 2.8 kb mRNA, designed such that on 
translation, ribosomes terminating at the F1 ORF stop 
codon would produce a 45 kd product (the “stopped” prod- 
uct) and those that frameshifted, a 95 kd product (see Fig- 
ure 1A). Since the region of IBV cDNA included in pFS7 
was considerably shorter than that in pFS1 (230 bp, as op- 
posed to 497 bp,), we first tested that the pFS7-derived 
mRNA contained a functional frameshift signal by trans- 
lating it in the reticulocyte lysate system. As can be seen 
in Figure 1B, frameshifting occurred at wild-type (25%) ef- 
ficiency. The predicted 45 kd “stopped” product migrates 
as a somewhat larger species (about 50 kd) on SDS-poly- 
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acrylamide gels, probably a consequence of the highly 
basic nature of the PB1 protein. As expected, antiserum 
asi, raised against the N-terminus of PB1, immunoprecipi- 
tated both the 45 kd and 95 kd products, indicating com- 
mon N-termini, and the C-terminal antiserum, as4, only 
the 95 kd product. When the same mRNA was microin- 
jected into Xenopus oocytes, frameshifting was also ob- 
served, with the stopped and frameshifted products 
clearly visible against the background of labeled oocyte 
proteins (see Figure 1B). The efficiency of frameshifting 
in vivo was similar to that seen in the reticulocyte lysate. 


Defining the Upstream Boundary of 

the Frameshifting Signal 

Retroviral frameshifting appears to involve at least two 
components, a “slippery” sequence, which is the site of 
tRNA slippage, positioned upstream of a potential stem- 
loop structure in the mRNA (Jacks et al., 1988b). Analysis 
of the IBV F1/F2 overlap reveals a similar sequence motif, 
UUUAAAC, some 6 bp upstream of a potential stem-loop 
(see Figure 2 and Brierley et al., 1987). Based on the origi- 
nal hypothesis of Jacks and Varmus (1985) on the mecha- 
nism of frameshifting in RSV, we suspected that the site 
of frameshifting in IBV would be the UUUA sequence, with 
a tRNA slipping from the UUA codon back to the UUU 
codon in the —1 frame during the frameshifting process. 
This possibility was tested by site-directed mutagenesis 
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Figure 2. Upstream Components of the IBV 
Frameshifting Signal 

(A) The position of specific deletions and point 
mutations created within the F1/F2 junction re- 
gion of plasmid pFS7 by site-directed mutagen- 
esis. The diagram shows part of the sequence 
of the RNA around the frameshift site, includ- 
ing the suspected “slippery” sequence UUU- 
AAAC (boxed) and the predicted stem-loop 
structure downstream. Shaded areas repre- 
sent regions of the influenza PB1 gene, and un- 
shaded areas, IBV-derived sequences, except 
for a linker sequence (boxed) that was derived 
from phage m13 during the cloning procedure 
(see Experimental Procedures). (B) Reticulo- 
cyte lysate translation products synthesized in 
response to mRNAs derived from Smal- 
digested pFS7 or mutant templates. Polypep- 
tides were labeled with [°°S]methionine, sepa- 
rated on a 10% SDS-polyacrylamide gel, and 
detected by autoradiography. 


linker = pB1 <C> 


using pFS7, which contains the bacteriophage f1 inter- 
genic region in a nonessential region of the vector (see Ex- 
perimental Procedures), and which can thus be converted 
to single-stranded DNA (Dotto et al., 1981). We first cre- 
ated a deletion from within the upstream PB1 sequences 
up to, but not including, the UUUAAAC motif of the IBV 
frameshift region (pFS7.3), and another from the same 
point to just beyond the UUUAAAC motif (pFS7.4) (Figure 
2). In each case the deletions were designed to preserve 
the upstream and downstream open reading frames pres- 
ent in the original plasmid. Synthetic mRNA was tran- 
scribed from these plasmids and tested for frameshifting 
by in vitro translation (Figure 2). The results confirmed the 
suspicion that the UUUAAAC motif was important, since 
frameshifting occurred with wild-type efficiency in pFS7.3 
but was totally abolished with pFS7.4. 

To investigate in more detail the role of this sequence, 
we made single base substitutions at certain positions 
within this region. In the first mutagenic construct, pFS7.1, 
UUUA was changed to CUUA, on the basis that a UUA to 
CUU tRNA!" slippage event ought to be unfavorable. As 
can be seen in Figure 2, with this mutant, frameshifting 
in the reticulocyte lysate system was reduced approxi- 
mately 10-fold, supporting the idea that a tRNA slips at 
this site. However, as frameshifting was not completely 
abolished, we tested whether an additional site of slip- 
page was involved, at the adjacent AAC codon. To avoid 
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Figure 3. Downstream Components of the IBV Frameshifting Signal 


(A) The position of specific deletions and a point mutation created within the F1/F2 region of plasmid pFS7 by site-directed mutagenesis. The diagram 
shows part of the sequence of the RNA from the overlap region. The underlined region highlights nucleotides with the potential to base pair with 
complementary nucleotides in the loop of the stem-loop structure predicted to form upstream. 

(B) Reticulocyte lysate translation products synthesized in response to MRNAs derived from Smal-digested pFS7 or mutant templates. Polypeptides 


were labeled and analyzed as described in the legend to Figure 2. 


perturbing the UUUA slip site, we altered the second 
nucleotide of the AAC codon to either U (pFS7.14) or C 
(pFS7.17) and tested for frameshifting (Figure 2). 
Frameshifting was not observed with pFS7.14 (UUU- 
AUAC), and was greatly reduced (to about 0.5%) with 
pFS7.17 (UUUACAC). The great reduction in frameshifting 
efficiency arising from these changes suggests that the 
tRNAs specified by the mutant sequences, i.e., tRNA 
(for pFS7.14) and tRNA"'s (for pFS7.17) are unable to shift 
into the -1 frame. This is not unreasonable, since such 
a shift would require the tRNA anticodon, AUG, to base 
pair with AUA in the mutant mRNA, and the tRNAH's an- 
ticodon, GUG, to pair with ACA (the anticodon sequences 
shown are based on standard Watson—Crick base pair- 
ing). In the pFS7.1 mutant, where low-level frameshifting 
was seen, the tRNA“&4 anticodon, AAU, could form one 
base pair with the mutant codon CUU; this may be suffi- 
cient to allow a low efficiency slip. Recently, Jacks and col- 
leagues (1988b) have proposed that in RSV, a double 
tRNA-slippage event occurs at the frameshift site, in 
which two adjacent ribosome-bound tRNAs slip into the 
-1 frame at the sequence AAAUUUA. Our results with 
IBV are consistent with this simultaneous slippage model, 
with the adjacent tRNA‘ and tRNA4®" slipping back by 


one nucleotide in the 5’ direction at the UUUAAAC site in 
IBV. 


Requirement for Downstream Sequences 

We next created a series of deletions downstream of the 
“slippery” sequence to define the 3’ limit of the frameshift- 
ing signal (Figure 3). Once again deletions were made by 
oligonucleotide-directed mutagenesis and were designed 
to preserve the original open reading frame. We expected 
on the basis of previous work that this analysis would re- 
veal a requirement for the nucleotides predicted to form 
a stem-loop structure just downstream of the slippery se- 
quence, and indeed, pFS7.6, in which one side of the 
predicted stem-loop was removed, displayed virtually no 
frameshifting. This reduction was not simply due to the 
removal of the F1 stop codon in pFS7.6, since alteration of 
the stop codon from UGA to UGU by mutagenesis in a 
separate construct, pFS7.2, resulted in a mRNA with 
slightly increased frameshifting efficiency (Figure 3). In 
this case a new termination codon is used which is located 
some 36 nucleotides downstream. Thus it seems that pre- 
cise positioning of a terminator is not a prerequisite for 
frameshifting, in agreement with the situation for RSV 
(Jacks et al., 1988b). Surprisingly, pFS7.5, which pre- 
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Figure 4. A Tertiary RNA Structure Forms in the IBV Frameshift Region 


(A) Diagram showing the predicted base pairing interactions within the RNA at the frameshift site in the wild-type (pFS7/WT) and in six mutant con- 
structs prepared by site-directed mutagenesis. The mutations in constructs pFS7.18, pFS7.19, and pFS7.20 were in the first stem (PK1/PK3), and in 
constructs pFS7.12, pFS7.15, and pFS7.16, the putative second stem (PK2/PK4) was altered. 

(B) Reticulocyte lysate translation products synthesized in response to MRNAs derived from Smal-digested pFS7 or mutant templates. Polypeptides 


were labeled and analyzed as described in the legend to Figure 2. 


serves the potential for stem-loop formation, also showed 
greatly reduced frameshifting, although in this case a re- 
sidual amount, about 2%-3%, was still observed. In light 
of this observation, we considered the possibility that an 
additional interaction may occur between the stem-loop 
structure and nucleotides downstream to create a tertiary 
structure in the MRNA. Structures of this kind have been 
proposed to occur in a number of positive-stranded viral 
RNAs (see Pleij et al., 1985; Clarke et al., 1987) and have 
been termed “pseudoknots.” The formation of a pseu- 
doknot necessitates the base pairing of nucleotides in a 
hairpin loop with nucleotides downstream, generating an 
extended quasi-continuous double helix. Close examina- 
tion of the IBV F1/F2 mRNA sequence revealed a stretch 
of 7 nucleotides (underlined) about 30 nucleotides down- 
stream of the predicted stem-loop which could potentially 
base pair with a stretch of nucleotides in the single-strand 
region of the hairpin loop. As can be seen in Figure 3, 
frameshifting on pFS7.10-derived mRNA, which retains 
this sequence, occurred with wild-type efficiency. How- 
ever, when most of this sequence was deleted, in con- 
struct pFS7.11, the frameshifting efficiency was dramati- 
cally reduced, which indicates that the nucleotides from 
this region are crucial for efficient frameshifting. 


The Frameshifting Signal Contains 

a Tertiary RNA Structure 

The formation of a pseudoknotted RNA structure down- 
stream of the UUUAAAC sequence would require two dis- 


crete base pairing interactions within the mRNA (as 
shown in Figure 4), one which would form the original 
stem-loop (between PK1 and PK3), and a second be- 
tween the loop of this structure (PK2) and the AGCCU- 
containing region downstream (PK4). As a first step, we 
sought direct evidence for the PK1-PK3 interaction, to 
confirm the existence of the first stem-loop, by creating 
complementary mutagenic changes within the PK1 and 
PK3 regions (see Figure 4). In pFS7.18, we sought to 
destabilize the stem by replacing six nucleotides of the 
PK1 sequence at the base of the stem (GGGGUA) with 
their complementary nucleotides. In pFS7.19, the six 
nucleotides of the PK3 sequence at the base of the stem 
(CCCCAU) were replaced by complementary nucleotides. 
Finally, in pFS7.20, both mutagenic changes were made, 
to create a double mutant in which the changes whould 
recreate base pairing and restabilize the stem. In both the 
single mutants (pFS7.18 and 7.19), frameshifting was abol- 
ished, yet the frameshifting efficiency returned to near 
wild-type levels in the double mutant, pseudo-wild-type 
construct, pFS7.20 (Figure 4). This result provides strong 
support for the PK1i-PKS interaction, and demonstrates 
that the stem-loop plays an important role in the frame- 
shifting process. 

We then went on to investigate the proposed PK2-PK4 
interaction in a similar manner. !n this case, we mutated 
the relevant regions by reversing a block of five nucleo- 
tides within each region (see Figure 4). Thus in pFS7.15, 
the PK2 sequence was changed from AGGCU to UCGGA, 
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Figure 5. Proposed Folding of the RNA at the IBV Frameshift Region: 
Formation of an RNA Pseudoknot 

Base pairing between nucleotides in the loop of the stem-loop struc- 
ture and a region downstream (A) results in the formation of an ex- 
tended double helix, shown schematically (B). The double helical 
regions S1 and S2 are connected by single-stranded loops L1 and L2. 
In this structure, S2 is stacked upon S1 such that a right-handed, quasi- 
continuous double-helix of 16 bp is formed. An artists impression of 
the three-dimensional organization of this structure is shown (C), as- 
suming that 9 bp form in S1, 7 bp form in S2, and one turn of the helix 
contains 11 bp (Arnott et al., 1972). In the resulting pseudoknot, L1 (2 
nucleotides in length) crosses the deep groove and L2 (32 nucleotides 
in length) the shallow groove. The “fold” program of Jacobson et al. 
(1984) did not predict any significant RNA secondary structures within 
L2. These diagrams are based on those presented by Rietveld et al. 
(1983) and Pleij et al. (1985). 


and in pFS7.12, the PK4 sequence was changed from 
AGCCU to UCCGA. In both mutated constructs, where the 
potential for base pairing is dramatically reduced, the 
frameshifting efficiency was lowered some 10-fold to 
about 2%. However, as was the case with the PK1-PK in- 
teraction, the creation of the double mutant, pFS7.16, in 
which the changes are complementary, restored the effi- 
ciency of frameshifting to approximately wild-type level. In 
these experiments, it was consistently observed that the 
“stopped” product from pFS7.15 and pFS7.16 had a slightly 
lower gel mobility than that produced by the wild-type and 
pFS7.12 constructs. This altered gel mobility is most likely 


due to the two amino acid substitutions (serine and aspar- 
tic acid replacing arginine and leucine) arising from muta- 
tion of the PK2 sequence in these plasmids. These experi- 
ments provide strong evidence that a tertiary RNA 
structure, a pseudoknot, forms at the frameshift site. A 
model for such a structure is shown in Figure 5. 


The Stippery Sequence and Pseudoknot 

Are Precisely Positioned 

Jacks and colleagues (1988b) have proposed that RNA 
secondary structure downstream of the “slippery” site may 
slow or stail ribosomes during translation, such that the 
ribosomal peptidyl (P) and acceptor (A) sites remain in the 
vicinity of the slippery sequence slightly longer, promoting 
the slippage event. A prediction of such a model is that the 
correct spacing distance must be maintained between the 
slippery sequence and the RNA structure concerned. We 
tested this by constructing two mutants in which the spac- 
ing was increased (pFS7.13) or decreased (pFS7.22) by 
three nucleotides (Figure 6). In pFS7.13, in which a tyro- 
sine codon (UAC) was inserted between the slippery se- 
quence and pseudoknot, frameshifting was abolished. In 
pFS7.22, deletion of the glycine codon (GGG) greatly re- 
duced frameshifting (to about 2%). Thus alteration of the 
relative position of the UUUAAAC sequence by as little as 
three nucleotides with respect to the pseudoknot causes 
a dramatic reduction in frameshifting efficiency. 


Discussion 


The genomic RNA of the coronavirus IBV contains a se- 
quence at the junction of the F1 and F2 ORF that can di- 
rect ribosomal frameshifting both in vitro and in vivo in 
Xenopus oocytes. We have defined, by deletion analysis, 
a stretch of approximately 86 nucleotides within this re- 
gion that contains all the necessary information for high 
efficiency frameshifting in a heterologous context. This 
minimal frameshift signal includes, at its 5’ end, the hepta- 
meric sequence UUUAAAC, which appears to be anala- 
gous to the “slippery sequences” described by Jacks et al. 
(1988a, 1988b) required for frameshifting in retroviral sys- 
tems. In addition, frameshifting is dependent upon the for- 
mation of a tertiary mRNA structure downstream of the 
slippery sequence. This represents direct evidence that 
tertiary folding in RNA can infiuence its translation and 
could have implications for a much wider range of biologi- 
cal processes involving RNA. The frameshifting system 
thus provides an excellent opportunity to study in more 
detail this novel kind of RNA structure. 


The Site of Frameshifting in Coronavirus IBV 

Our results suggest strongly that frameshifting occurs at 
the UUUAAAC sequence. A deletion that removed this re- 
gion abolished frameshifting, and point mutations within 
the region were severely inhibitory. This sequence corre- 
sponds in organization to the “slippery” sequences pres- 
ent in the gene overlaps of demonstrated and suspected 
retroviral frameshift sites (Jacks and Varmus, 1988b). In 
the three retroviral overlaps where the site of slippage has 
been established by protein sequencing, the shift occurs 
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at the downstream codon in the slippery sequence, i.e., 
at the leucine codon in RSV (AAAUUUA) and HIV-1 
(UUUUUUA), and at the asparagine codon in MMTV 
gagipro (AAAAAAC) (Hizi et al., 1987). It seems likely 
therefore that the IBV shift occurs at the asparagine codon 
(UUUAAAC), although confirmation will require protein 
sequencing at the junction of the frameshifted product. 

Jacks et al. (1988b) have proposed a model for ribo- 
somal frameshifting in RSV which involves simultaneous 
slippage on the mRNA of tRNAs occupying both the P and 
Asites of the ribosome. They postulate that such slippage 
would be possible only if base pairing could subsequently 
be maintained between the first two positions of each an- 
ticodon loop and the new mRNA codon. Our data for IBV 
frameshifting are consistent with such a model. Following 
slippage, base pairing could be maintained between the 
first two nucleotides of each anticodon loop and the 
mRNA (with the post-slippage anticodons AAU and UUG 
pairing with the mRNA codons UUU and AAA, respec- 
tively, after slippage). If this model is correct, it might be 
expected that reducing the capacity to form base pairs in 
the “slipped” position would affect frameshifting dramati- 
cally, and this indeed was observed. Two single site muta- 
tions (UUACAC, pFS7.14 and UUUAUAC, pFS7.17), which 
would prevent tRNA-mRNA pairing in the A site following 
slippage, greatly reduced or abolished frameshifting 
(0.5% or less). In mutant pFS7.1 (CUUAAAC), in which 
complementarity in the “slipped” position would be re- 
duced from 2 to 1 bp in the P site, frameshifting was again 
markedly reduced, although in this case, some activity 
was retained (efficiency of 2%). This difference in frame- 
shifting efficiency between the A and P site mutants may 
reflect the number of base pairs that can form following 
slippage at each site. Alternatively, the tRNA reading the 
Asite may play a more important role in frameshifting than 
that occupying the P site, as suggested by Jacks and col- 
leagues (1988b) for RSV and other retroviral frameshift 
sites. 


Figure 6. The Effect on Frameshifting Effi- 
ciency of Changing the Relative Position of the 


x 9 “Slippery” Sequence with Respect to the Ter- 


tiary RNA Structure 

(A) Diagram showing two mutant derivatives of 
pFS7 in which the distance between the UUU- 
AAAC sequence (boxed) and the tertiary RNA 
structure was increased (pFS7.13) or de- 
creased (pFS7.22) by three nucleotides. 

(B) Reticulocyte lysate translation products 
synthesized in response to mRNAs derived 
from Smal-digested pFS7 or mutant templates. 
Polypeptides were labeled and analyzed as de- 
scribed in the legend to Figure 2. 
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Evidence for Tertiary RNA Folding 

in the IBV Frameshift Signal 

Our deletion analysis of the frameshift signal of IBV indi- 
cates the requirement not only for the UUUAAAC ‘“slip- 
pery sequence” at the 5’ end, but also of a stretch of 
nucleotides downstream. We previously suggested that a 
stem-loop structure predicted from the primary sequence 
of this downstream region might be an important element 
of the frameshift signal (Brierley et al., 1987), and the 
results reported here are consistent with this. However, 
our experiments indicate that this stem-loop forms part of 
a more complex tertiary structure, since base pairing was 
apparent between nucleotides of the predicted loop and 
a region downstream. Until direct structural information is 
obtained, the precise three-dimensional organization of 
the RNA remains uncertain. However, we believe that the 
RNA folds to form a structure similar in principle to the 
pseudoknots predicted to occur in a number of plant viral 
RNAs (see Pleij et al., 1985), foot and mouth disease virus 
(FMDV) RNA (Clarke et al., 1987), E. coli 5S RNA 
(Goringer and Wagner, 1986), E. coli 16S rRNA (Maly and 
Brimacombe, 1983; Pleij et al., 1985), and in the E. coli a 
MRNA leader sequence (Deckman and Draper, 1987). A 
similar kind of structure has also been described for a 
short synthetic RNA (Puglisi et al., 1988). The construc- 
tion of the proposed IBV pseudoknot is outlined in Figure 
5. A stretch of nucleotides in the loop of the hairpin and 
bordering the stem region form normal Watson-Crick 
base pairs with nucleotides of the complementary se- 
quence located downstream in the RNA. Two separate 
double-helical regions form two stem structures (marked 
$1 and S2), which stack coaxially to generate an elon- 
gated double-stranded helix. It has been assumed (Pleij 
et al., 1985) that in interactions of this kind, the stems are 
stacked on top of each other such that a quasi-continuous, 
right-handed double helix is formed, comparable to A-RNA. 
Such an assumption is valid if the single stranded con- 
necting loops L1 and L2 pose no sterical constraints upon 
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this structure (for a detailed discussion, see Pleij et al., 
1985). The end product of the folding is a pseudoknot 
(Studnicka et al., 1978) in which the connecting loops L1 
and L2 span the deep and shallow grooves of the double- 
helix, respectively. 

The IBV stem-loop predicted by the computer-based 
RNA folding program of Jacobson et al. (1984) (Figure 3) 
has 11 bp in the stem and 5 unpaired nucleotides in the 
loop. As it stands, it seems unlikely that this precise struc- 
ture could form a pseudoknot, since all the loop nucleo- 
tides are predicted to base pair with elements down- 
stream, and no unpaired nucleotides remain to cross the 
deep groove of the double-stranded helix. To account for 
the results of our mutagenic analysis of the region, we 
suggest that an alternative stem-loop forms (shown in 
Figure 4) which, because a number of previously paired 
stem nucleotides now form part of the single-stranded 
loop, is capable of the tertiary interaction. In this model, 
the weak UG pair and the GC pair at the top of the stem 
are included in the loop (as is the previously unpaired G). 
This would potentially allow seven of the nine loop nucleo- 
tides to base pair with the downstream region to form S2, 
leaving two unpaired loop nucleotides to span the deep 
groove in the pseudoknot. The bridging of 7 bp in the deep 
groove by two loop nucleotides is sterically feasible (see 
Pleij, 1985). An artist's impression of the three-dimension- 
al folding of the proposed IBV pseudoknot is shown in Fig- 
ure 5. A quasi-continuous double helix of 16 bp in length 
is formed. As is the case in a number of the plant viral 
pseudoknots, loop 2 is considerably longer than loop 1. As 
only 7 bp can form between the loop and the downstream 
sequence, it is unlikely that a real knot could form in the 
RNA, since a full turn of an A-RNA duplex contains 11 bp 
(Arnott et al., 1972). 

Our results therefore suggest strongly that an RNA 
pseudoknot is an essential element of the IBV frame- 
shifting signal, although its precise organization remains 
to be elucidated. However through direct structural analy- 
sis of artificially synthesised wild-type and mutated 
mRNAs in combination with functional analysis by in vitro 
translation, it should be possible to investigate in consid- 
erable detail this novel RNA building principle. 


The Role of Tertiary RNA Structure 

in Ribosomal Frameshifting 

How does the presence of a pseudoknot in IBV mRNA 
lead to efficient frameshifting? The answer to this ques- 
tion is not yet clear, but an obvious possibility is that the 
elongating ribosome may be required to unwind the pseu- 
doknot, and this could slow or stall its passage along the 
mRNA sufficiently to allow tRNA slippage at the “slippery” 
site. In this case it might be expected that mutations which 
destabilize, but do not abolish entirely, secondary and ter- 
tiary structure in the downstream region would be partially 
inhibitory to frameshifting. Indeed, in the case of mutants 
where the first predicted stem (S1) was allowed to form but 
not the second (S2), that is pFS7.5, pFS7.11, pFS7.12, and 
pFS7.15, frameshifting was detected, although at a much 
lower level. This raises the possibility, currently being 
tested, that intermediate levels of frameshifting would be 


observed if one or other of the two stems were only par- 
tially destabilized. Whatever model is advanced to explain 
how a pseudoknot promotes frameshifting must account 
for the precise spacing required between the knot and the 
“slippery” sequence, since deletion or insertion of as little 
as three bases was sufficient to reduce frameshifting dra- 
matically. A possible explanation for this is that in these 
mutants, the ribosomal P and A sites are positioned on se- 
quences that are refractory to slippage during a ribosomal 
pause. 

A second important question arising out of this work is 
whether tertiary structures of this kind are a more general 
feature of (-1) frameshift sites. A large number of 
retroviruses, related retrotransposons (Jacks et al., 1988b) 
and a luteovirus (Miller et al., 1988) are known or sus- 
pected to utilize (—1) frameshifting. We have examined the 
known or suspected sites of such frameshifting (those 
listed and referenced in Jacks et al., 1988b) by predicting, 
with the aid of a computer program, the secondary struc- 
ture of the RNA at the appropriate regions and then look- 
ing for potential tertiary base pairing interactions in the 
downstream sequences. We found that the “fold” program 
of Jacobson and colleagues (1984) predicted hairpin 
loops immediately downstream of the slippery sequences 
at all the sites except in the retrotransposon 17.6. We 
scanned the region downstream of the predicted stem- 
loops for sequences that were complementary to the ioop 
nucleotides, setting 50 nucleotides as the maximum dis- 
tance scanned and 5 nucleotides as the minimum size of 
the complementary region. The result obtained was rather 
striking: 14 out of the 22 sequences examined appear to 
contain the potential for pseudoknot formation. The num- 
ber of complementary nucleotides was between 5 and 8, 
and these were located between 3 and 32 nucleotides 
from the base of the upstream stem-loop. Shown in Fig- 
ure 7 are the potential interactions at the RSV gag/pol, 
MMTV gagipro and pro/po! overlaps and some examples 
from cases where frameshifting is suspected to occur, 
with the complementary loop and downstream nucleo- 
tides highlighted. In those systems where frameshifting 
has been confirmed (RSV, MMTV, IBV, and HIV-1), only 
HIV-1 did not contain a suitable downstream sequence. 
Madhani et al. (1988) and Wilson et al. (1988), however, 
have observed recently that frameshifting in HIV-1 ap- 
pears to occur in the absence of any recognizable RNA 
structures at the frameshift site, so this may not be unex- 
pected. In addition to HIV-1, seven of the suspected sites 
did not show potential for the formation of tertiary RNA 
structure within the limits of this simple examination. 
Nevertheless, the possibility remains that tertiary RNA in- 
teractions may be a common feature of (—1) frameshift 
sites. In the case of RSV, it has been shown (Jacks et al., 
1988b) that a deletion which left the stem-loop intact, but 
did not include downstream RSV sequences, had reduced 
frameshifting efficiency. However, when an additional 22 
downstream nucleotides were included, frameshifting 
was restored to approximately wild-type efficiency. As can 
be seen in Figure 7, the additional sequences include an 
eight nucleotide stretch (UGUAGCGC) with full com- 
plementarity to a region in the loop of the RSV hairpin, 
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Figure 7. Potential Tertiary RNA Structures at Other (—1) Frameshift Sites 


The figure shows the RNA secondary structures predicted by the “fold” program (Jacobson et al., 1984) at a number of retroviral gene overlaps 
where frameshifting is known (RSV, MMTV) or suspected to occur (Visna, HTLV1, BLV). Nucleotides (ntds) that could participate in base pairing 
interactions between the loop and downstream regions are highlighted. The known or suspected “slippery” sites are boxed. The nucleotide sequences 
were taken from the references cited in Jacks et al. (1988b). 


which might therefore interact through base pairing. In- carrying bacteria with helper bacteriophage R408 (Russel et al., 1986). 
troduction of single and compensatory base changes in The orientation of this fragment was such that the noncoding strand 


. ‘ : . : of pFS7 was packaged into R408 phage particles. The influenza-IBV- 
this region, through site-directed mutagenesis, should re- influenza junctions were confirmed by dideoxy nucleotide sequencing 


solve this question. (Sanger et al., 1977) of single-stranded pFS7. Both pFS6 and pFS7 
were designed such that ribosomes could translate the length of the 
PB1 mRNA following a frameshift event within the inserted IBV region 
Experimental Procedures (see Figure 1). Plasmids were maintained in E. coli JM101 (Yanisch- 


Perron et al., 1985). 
Construction of Plasmid pFS7 


Plasmid pFS1 (Brierley et al., 1987) was digested with Ddel and a 230 Site-Specific Mutagenesis of pFS7 

bp fragment containing the F1/F2 overlap region isolated (IBV se- A procedure based on the method of Kunkel (1985) was employed to 
quence information from position 12,286 to 12,509 bp; Boursnell et al., prepare site-specific mutations within the IBV frameshift region. Plas- 
1987). Following end-filling with E. coli DNA polymerase | Klenow en- mid pFS7 (or mutant derivatives) was transformed into E. coli strain 
zyme, the fragment was inserted into Sspl partially digested plasmid RZ1032 (dut-ung’) (Kunkel, 1985) and single-stranded, uracil- 
pST1+ (Digard et al., submitted) to produce pFS6. pST1+ comprises containing, pFS7 DNA isolated following superinfection with bacterio- 
the influenza virus A/PR8/34 PB1 gene (Young et al., 1983) inserted phage R408. Phosphorylated mutagenic oligonucleotides (5 pmol) 
into the Bglll site of plasmid pSP64-T (Krieg and Melton, 1984). Thus were annealed to this template (0.1-0.5 yg) in a 10 ul reaction mixture 
pFS6 consists of pSP64- T/PB1 with the IBV frameshift region inserted containing 10 mM Tris-HCl (pH 8), 10 mM MgClz, 50 mM NaCl, and 1 
at position 1162 in the PB1 gene, such that both the F1 and F2 ORF mM dithiothreitol (OTT) at 55°C for § min. After cooling at room temper- 
were in frame with the flanking PB1 sequences, with F1 in frame 1 and ature (10 min), the primer was extended by addition of dATP, dCTP, 
F2 in frame 3. Plasmid pFS7 (see Figure 1) was constructed by insert- dGTP, and dTTP (0.5 mM each); ATP (0.5 mM), E. coli DNA polymerase 
ing a 519 bp Rsal fragment from pEMBLS8 (Dente et al., 1983) into the | Kienow enzyme (6 U, Boehringer Mannheim) and T4 DNA ligase (2.5 
unique Pvull site of pFS6. This fragment included the intergenic region U, Bethesda Research Laboratories) to a final reaction volume of 20 
of filamentous bacteriophage fl (Dotto et al., 1981) enabling single- ul after readjusting the concentration of Tris-HCI, MgCl2, and DTT to 


stranded pFS7 DNA to be generated following infection of plasmid- 40 mM, 75 mM, and 3 mM, respectively. The extension-ligation reac- 
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tion was for 2 hr at room temperature, and then for an additional 2 hr 
after supplementing the reaction with additional Klenow (6 U) and li- 
gase (2.5 U). Portions of the reaction were transformed into E. coli 
JM101, and single-stranded templates prepared from transformants by 
R408 superinfection. Mutants were identified by dideoxy sequencing. 
The efficiency of mutagenesis was between 20% and 70%. 


Preparation of DNA Template, Transcription and 

Translation of mRNA 

Plasmids for transcription were prepared by the alkaline lysis 
minipreparation method (Birnboim and Doly, 1979), linearized by 
digestion with Smal, and transcribed as described previously (Brierley 
et al., 1987). Messenger RNAs were translated in rabbit reticulocyte ly- 
sates as described before (Brierley et al., 1987). Transcripts were trans- 
lated in Xenopus oocytes as described by Colman (1984). Individual 
cocytes were microinjected with approximately 50 ng mRNA, and la- 
beled with [°S]methionine (10 Ci) from 2-6 hr following injection. 
Translation products were analyzed on 10% SDS-polyacrylamide gels 
according to standard procedures (Hames, 1981). The relative abun- 
dance of nonframeshifted or frameshifted products on the gels was es- 
timated by either scanning densitometry or by measuring the radioac- 
tive content of excised gel slices. In each case, a correction was made 
to account for the differential methionine content of the products. 


Antisera against Defined Regions of the Influenza Virus 

PB1 Protein 

Rabbit antisera against defined regions of the PB1 protein were pre- 
pared by immunization with PB1 fragments expressed as fusion pro- 
teins with B-galactosidase (Stanley and Luzio, 1984). Antisera 1 (ast) 
and 4 (as4) covered the regions encoded by nucleotides 150-1110 and 
1464-2256, respectively. An antiserum raised against a region of the 
influenza A/PR8/34 PA protein by a similar procedure was used as a 
control. Radioimmune precipitation of reticulocyte translation products 
was performed as described (Brierley et al., 1987). 
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